Language Accent Classi cation in American EnglishxLevent
نویسنده
چکیده
It is well known that speaker variability caused by accent is one factor that degrades performance of speech recognition algorithms. If knowledge of speaker accent can be estimated accurately, then a modiied set of recognition models which addresses speaker accent could be employed to increase recognition accuracy. In this study, the problem of language accent classiication in American English is considered. A database of foreign language accent is established that consists of words and phrases that are known to be sensitive to accent. Next, isolated word and phoneme based accent classiication algorithms are developed. The feature set under consideration includes Mel-cepstrum coeecients and energy, and their rst order diierences. It is shown that as test utterance length increases, higher classiication accuracy is achieved. Isolated word strings of 7-8 words uttered by the speaker results in an accent classiication rate of 93% among four diierent language accents. A subjective listening test is also conducted in order to compare human performance with computer algorithm performance in accent discrimination. The results show that computer based accent classiication consistently achieves superior performance over human listener responses for classiication. It is shown however, that some listeners are able to match algorithm performance for accent detection. Finally, an experimental study is performed to investigate the innuence of foreign accent on speech recognition algorithms. It is shown that training separate models for each accent rather than using a single model for each word can improve recognition accuracy dramatically. PACS Code 43.72 z Permission is hereby granted to publish this abstract separately.
منابع مشابه
Foreign accent classification using source generator based prosodic features
Source Generator Based Prosodic Features John H.L. Hansen and Levent M. Arslan Robust Speech Processing Laboratory Duke University Department of Electrical Engineering Box 90291, Durham, North Carolina 27708-0291 ABSTRACT Speaker accent is an important issue in the formulation of robust speaker independent recognition systems. Knowledge gained from a reliable accent classi cation approach could...
متن کاملAutomatic annotation and classification of phrase accents in spontaneous speech
During the last years, we have been working on the automatic classi cation of boundaries and accents in the German VERBMOBIL (VM) project (human-human communication, appointment scheduling dialogues). A sub-corpus was annotated manually with prosodic boundary and accent labels, and neural networks (NN) trained with a large set of prosodic features were used for automatic classi cation. The clas...
متن کاملFrequency characteristics of foreign accented speech
In this study, frequency characteristics of foreign accented speech is investigated. Experiments are conducted to discover the relative signi cance of di erent resonant frequencies and frequency bands in terms of their accent discrimination ability. It is shown that second and third formants are more important than other resonant frequencies. A lter bank analysis of accented speech supports thi...
متن کاملAn elitist approach to automatic articulatory-acoustic feature classi cation for phonetic characterization of spoken language
A novel framework for automatic articulatory-acoustic feature extraction has been developed for enhancing the accuracy of placeand manner-of-articulation classi cation in spoken language. The ‘‘elitist’’ approach provides a principled means of selecting frames for which multi-layer perceptron, neural-network classi ers are highly con dent. Using this method it is possible to achieve a frame-lev...
متن کاملIntegrated dialog act segmentation and classification using prosodic features and language models
This paper presents an integrated approach for the segmentation and classi cation of dialog acts (DA) in the Verbmobil project. In Verbmobil it is often su cient to recognize the sequence of DAs occurring during a dialog between the two partners. In our previous work [5] we segmented and classi ed a dialog in two steps: rst we calculated hypotheses for the segment boundaries and decided for a b...
متن کامل